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1 Introduction 


Phylogenetic networks are rooted, directed acyclic graphs whose leaves are labelled by some set of species (see Section 2 
for precise definitions of the concepts that we introduce in this section). Such networks are used by biologists to represent 
the evolution of species that have undergone reticulate events such as hybridization and there is much recent work on 
these structures (cf. e.g. Gusfield, 2014; Huson et al, 2010). In Huber and Moulton (2006) a close relationship is 
described between phylogenetic networks and multi-labelled trees (MUL-trees), leaf-labelled trees where more than one 
leaf may have the same label. Essentially, it is shown that it is always possible to “unfold” a phylogenetic network N 
to obtain a MUL-tree U(N) and that, conversely, a MUL-tree T can under certain conditions be “folded” to obtain a 
phylogenetic network F(T) that exhibits T. We illustrate these operations in Fig. 1 (see Section 3 for more details). The 
tree T in (i) is a MUL-tree, and the network in (iii) is obtained by first inserting vertices into T and then folding up the 
resulting tree by identifying vertices in T to obtain F(T); the unfolding U(N) of N (which is essentially obtained by 
reversing this process) is precisely T. 







Fig. 1 (i) A MUL-tree T, and (iii) the folding F(T) of T. with (ii) the intermediate tree T' used to guide the folding. 


Applications of the operations F and U include the construction of evolutionary histories of polyploids in terms 
of phylogenetic networks (Lott et al, 2009; Marcussen et al, 2015). In particular, polyploid organisms contain several 
copies of a genome, and if a tree is constructed from these genomes (or specific genes in these genomes) a MUL- 
tree can be obtained by labelling each leaf by the species that has the corresponding genome. By folding this MUL- 
tree a representation of the evolution of the species can then be obtained (in terms of a phylogenetic network), from 
the evolutionary history of the genomes. In this representation, vertices in the network with indegree two represent 
hybridisation events, where two parent species have produced a child which has the combined set of genomes of both of 
its parents. 

In this paper, we study properties of the F and U operations in some detail and, in the process, show that they 
have some interesting connections with other areas such as gene tree/species network reconciliation (Wu and Zhang, 
2011; Zhang et al, 2011) and the theory of graph fibrations (Boldi and Vigna, 2002). To do this, we begin by reviewing 
the concepts of MUL-trees and phylogenetic networks in the next section, and present some general properties of the 
folding and unfolding operations in Section 3. We then consider the interrelationship between the folding and unfolding 
operations. 

More specifically, although it is always the case that U(F(T)) is isomorphic to T for any MUL-tree T (Huber and 
Moulton, 2006), the same situation does not apply if the U and F operations are applied in the opposite order to some 
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network as there are networks N for which F(U(N)) is not isomorphic to N (we give an example shortly in Fig. 2). 
Therefore, it is of interest to understand the networks N for which F(U(N)) is isomorphic to N. We call these stable 
networks. In Section 4, we present a characterization for stable phylogenetic networks (see Theorem 1). Using this result 
we are then able to show that the well-known class of binary, tree-sibling networks as defined in Cardona et al (2009) are 
stable (see Corollary 1). We expect that stable networks could be of interest as they can provide a canonical representative 
for the set of all networks that display a particular MUL-tree (cf. Pardi and Scornavacca, 2015, for choosing canonical 
representatives of networks that display a set of trees). 

In Section 5, we show that the unfolding and folding operations are closely related to concepts that arise in the theory 
of graph fibrations (cf. Boldi and Vigna, 2002, for a review of this area). In particular, we define the concept of a folding 
map between a MUL-tree and a phylogenetic network. As one consequence, we show that the unfolding of a network 
can be considered as a phylogenetic analogue of the universal cover of a digraph. This allows us to provide an alternative 
characterisation for stable networks (Corollary 3). It is woth noting that an alternative framework for considering maps 
between phylogenetic networks in developed in Willson (2012). 

We then focus on the problem of displaying trees in networks. In Section 6, we demonstrate that it is NP-complete 
to decide whether or not a phylogenetic tree is displayed by a stable network (Theorem 5). This is of interest since in 
Kanj et al (2008) it is shown that it is NP-complete to decide if a tree is displayed by a network, but in van Iersel et al 
(2010) it is shown that this problem is polynomial for certain special classes of networks (such as normal and tree-child 
networks). 

Finally, in Section 7, we define and study a new way in which a tree may be displayed in a network: We say that 
a phylogenetic tree is weakly displayed by a phylogenetic network N if it is displayed by the MUL-tree U(N). Using 
the concepts developed in Section 5, we provide a characterization for when a tree is weakly displayed by a network in 
terms of a special type of tree reconciliation (Theorem 6). This characterisation allows us to show that, in contrast to 
displaying a tree, it is possible to decide in polynomial time whether or not a phylogenetic tree is weakly displayed by a 
phylogenetic network having the same leaf-set (Corollary 4). 


2 Definitions 

Throughout this paper, we let X denote a finite set of size at least two. In addition, all graphs that we consider are 
connected. 


2.1 Rooted DAGs 

Suppose G is a directed acyclic graph in which multiple arcs are allowed and which has a single root, denoted by pc- 
We denote the set of vertices of G by V(G) and the set of arcs by A(G). If v is a vertex of G, then the in-degree of v, 
denoted by indeg(v), is the number of incoming arcs of v, and the out-degree of v, denoted by oiitdeg(v), is the number 
of outgoing arcs of v. We say that a vertex v € V(G) is below a vertex w € V(G) — {v} if there exists a directed path 
from w to v in G. We call a vertex v of G a reticulation vertex of G, if outdeg(v ) = 1 and indeg(v) > 2 holds. We call v 
a tree vertex of G if indeg{v) = 1 and either outdeg(y) = 0 holds, in which case we call v a leaf of G, or outdeg(v) > 2. 
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We denote the set of leaves of G by L(G) and the set of interior vertices v of G, that is, v is neither the root nor a leaf of 
G, by V(G). 

2.2 MUL-trees 

We say that a multi-set M is a multi-set on A if the set resulting from M by ignoring the multiplicities of the elements in M 
is X. Following Huber and Moulton (2006), we define a pseudo multi-labelled tree S? on X, or a pseudo MUL-tree on X 
for short, to be a pair (T, %) consisting of a rooted directed tree T together with a labelling map % : X —► &{S) = 2 s — {0} 
from X into the set &(S) of non-empty subsets of the leaf set S = L(T) of T such that 

(i) for all x,y GX distinct %(x) n A(y) = an d 

(ii) for every leaf s € S there exists some x £ X with s £ x( x )- 

We call T a MUL-tree on X if it does not have any vertices with in-degree one and out-degree one. For example, in 
Fig. 1, the tree in (i) is a MUL-tree on X = {a,b,c,d}, and the tree in (ii) is a pseudo MUL-tree on X, but it is not a 
MUL-tree on X.. If the map % is clear from the context then we will write T rather than (T -X) and if the set X is of 
no relevance to the discussion then we will call T a MUL-tree rather than a MUL-tree on X. We say that two (pseudo) 
MUL-trees (T\ ./Xi ) and (72, £ 2 ) on X are isomorphic if there is a digraph isomorphism % : V(Ti) —> V(T 2 ) such that, for 
all x £ X and v £ V(Ti), we have v £ Xi (x) if and only if E, (v) £ X 2 {x). 

Suppose T is a pseudo MUL-tree. For v a non-root vertex of T, we denote by T (v) the connected subgraph of T that 
contains v obtained by deleting the incoming arc of v. Clearly T (v) is a pseudo MUL-tree. We call a pseudo MUL-tree 
T' a pseudo subMUL-tree of T if there exists a non-root vertex v of T such that T(y) and T' are isomorphic. For T 
a MUL-tree we say that a subMUL-tree T' of T is inextendible if there exist distinct vertices v and v' of T such that 
T' = T (v) and T (v) and T (v') are isomorphic. Loosely speaking, a subMUL-tree of T is inextendible if T contains more 
than one copy of that subMUL-tree. We say that a subMUL-tree T' of T is maximal inextendible if T' is inextendible 
and any other inextendible subMUL-tree T" of T that contains T' as a subMUL-tree is isomorphic with T'. 

To illustrate these definitions consider for example the MUL-tree T and its folding F(T) depicted in Fig. l(i) and 

(iii) , respectively. Then the three leaves labelled b are all inextendible subtrees of T and so are the two leaves labelled 
c. Each one of two subtrees of T of length two (ignoring the directions of the arcs of T) that have leave set {b.c\ is 
maximal inextendible. 

2.3 Phylogenetic networks 

An X-network N is a rooted directed acyclic multi-graph such that 

(i) there exists a unique root p,v of N that has in-degree zero and out-degree at least two, 

(ii) every vertex of N except the root is either a reticulation vertex or a tree vertex, 

(iii) there exists no vertex of in-degree one and out-degree one, and 

(iv) the set L(N) of leaves of N is X. 

A phylogenetic network (on X) is an A-network that does not have multi-arcs. A phylogenetic tree on A is a phylo¬ 
genetic network on A that has no reticulation vertices. We say that a phylogenetic network N is binary if the degree of 
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every reticulation vertex and every non-leaf tree-vertex is three and outdeg(p n) = 2. Finally, we say that two X-networks 
N and N' are isomorphic if there exists a bijection K : V (N) —>■ V (N') such that for all vertices m, v £ V(N) the number of 
arcs in N with head u and tail v equals the number of arcs in N' with head k(u) and tail ic(v), and K is the identity on X. 

3 Folding and unfolding 

In this section, we recall the unfolding and folding operations mentioned in the introduction that were first proposed in 
Huber and Moulton (2006) (see also Huber et al, 2012, for the binary case). 

We first describe the unfolding operation U which constructs a pseudo MUL-tree U*(N) from an A"-network N as 
follows: 

- the vertices of U* (N) are the directed paths in N that start at p n, 

- there is an arc from vertex K in U*(N) to vertex n' in U*(N ) if and only if n' = na (i.e. Ka is the path K extended 
by the arc a), and 

- the vertices in U* (N) that start at p v and end at some x in X are labelled by x. 

The MUL-tree obtained by suppressing all in-degree one and out-degree one vertices in U*(N), if there are any, is 
denoted by U ( N ). In Huber and Moulton (2006) it is shown that U(N) is indeed a MUL-tree. 

We denote for all vertices v of an directed graph G as in Section 2.1 the set of children of v by <:h(v) and say that 
an X-network IV exhibits a MUL-tree T if the MUL-trees U(N) and T are isomorphic. In particular, any X-network N 
exhibits the MUL-tree U (N). Note that there exist MUL-trees T for which there is no phylogenetic network that exhibits 
T (for example, the binary MUL-tree with two leaves both labelled by the same element). 

We now describe the folding operation F for constructing an A"-network F(T) from a MUL-tree T (cf. (Huber and 
Moulton, 2006, p. 628) for more details and Fig. 1 above for an illustration), which can be thought of as the reverse 
of the unfolding operation U. We first construct a pseudo MUL-tree T 1 from T which will guide this operation. To do 
this we need to define a sequence x : T = T \. f), ... of pseudo MUL-trees. Suppose i > 1 is such that we have already 
constructed tree 7,. Then we obtain T l+ \ as follows: If there is no inextendible subMUL-tree of 7), we declare 7} to be 
the last tree in x. Otherwise, we take a maximal inextendible subMUL-tree of 7,. Let v be the root of this tree and let S v 
be the subset of vertices w of 7} with 7}(w) isomorphic to 7}(v). Then, to obtain T i+ u for each w £ S v — {v} we remove 
the subtree 7}(vv) and the arc with head w from 7,. If this has rendered the root p-/ l of T, a vertex with out-degree one then 
we collapse the remaining arc with tail p 7/ . Otherwise, we suppress the resulting vertex with in-degree and out-degree 
one. 

Now, to obtain 7 ; , we consider each tree in x other than T in turn. Let i > 2 and assume that v£ V (T) is such that 
Tj is constructed from 7) ]. Then we subdivide all of those arcs a in T for which T(h T (a)) and T’(v) are isomorphic 
(as pseudo MUL-trees). The pseudo MUL-tree obtained once the last element in x has been processed is T T . Finally, to 
obtain F(T), we define an equivalence relation ~ r t on V(T^) that identifies all pairs of vertices v,w in V(T f ) with 7^(v) 
isomorphic with T^(w) (as pseudo-MUL-trees), and let F(T) be the A'-nctwork obtained by taking the quotient of 7’ 1 
by ~ r t. More precisely, let G(T) denote the DAG with vertex set {[k]~ t : u £ V (7’ T )} and (multi)-set of arcs obtained 

by joining any two vertices n, v £ V(T^) for which [h]~ t 4 1 [v]~ t holds by m > 0 arcs ([m] _, [v]~ ^) if and only if for 

one (and hence for all) u' £ [u] the size of ch(u') IT is m. The A'-nctwork obtained from G(T) by suppressing all 

vertices of indegree one and outdegree one and defining the leaf labels in the natural way is F(T). 
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For example, consider the MUL-tree T depicted in Fig. l(i). Then the pseudo MUL-tree displayed in Fig. 1 (ii) is T'. 
The two vertices labelled u make up [w]~ t and the vertex in F(T), depicted in Fig. l(iii), representing [k]~ t is labelled 
u. Similarly, the two vertices labelled vv in 7’ make up [w]~ r+ which we again represent in F(T) in terms of w. Clearly 
^ M~ r t anc * \ch{u') (T [w]~ 1 1 = 1 holds for all it' £ [m]~ t - Hence, there is precisely one arc in F(T) from u to 

w. 

Note that any MUL-tree T is isomorphic with U(F(T)) (as MUL-trees) (Huber and Moulton, 2006). Thus, if there 
is no risk of confusion we will sometimes identify T and U(F(T)). Also, note that if T is binary, then F(T) is semi- 
resolved, that is, every tree vertex in F(T) has out-degree 2. Moreover, in Huber and Moulton (2006, Proposition 3) 
it is shown that if F(T) is semi-resolved, then F(T) has the minimum number of reticulation vertices amongst all 
phylogenetic networks that exhibit T. 

In general, the folding of an arbitrary MUL-tree on X need not be a phylogenetic network. An example for this is 
again furnished by the MUL-tree with leaf set the multi-set {a,a}. We now characterize those MUL-trees T for which 
F(T) is a phylogenetic network. For clarity of exposition we denote, for any rooted directed acyclic graph G in which 
multiple arcs are allowed and any a = ( u,v) in G by li(a) = Iiq{o) = v its head and by t(a) = tc{a) = u its tail. 

Proposition 1 Suppose T is a binary MUL-tree on X. Then F(T ) is a phylogenetic network if and only if there is no 
pair of distinct vertices \\w in T which share a parent in T and are such that T (v) and T (w) are inextendible. 

Proof We prove the claim that if F(T) is a phylogenetic network then there is no pair of distinct vertices in T with 
the stated property by establishing the contrapositive. Suppose T is a binary MUL-tree on X that contains two distinct 
vertices v and w which share a parent in T and are such that T(v) and T(w) are inextendible. Without loss of generality, 
we may assume that v and w are such that there exist no two vertices v' and w' of T on the directed paths from the root 
of T to v and w, respectively, such that T{v') and T (w') are inextendible and the parent of v' is also the parent of w'. If 
7» is maximal inextendible then F{T) contains a multi-arc and so F(T) is not a phylogenetic network, as required. 
So, assume that T (v) is not maximal inextendible. Then there must exist some vertex v" in T distinct from v and w such 
that T(v") is maximal inextendible and 7’(v) is a subMUL-tree of T(v"). 

Let zo = v",zi, ■ ■ ■ ,zi, l > 0, denote the vertices on the directed path from v" to v such that T (zi) is inextendible and 
is rendered maximal inextendible during the folding of T. Then zi = v must hold as every MUL-tree T(zi), 0 < i < l, 
must contain both T(v) and T(w) as a subMUL-tree. Thus, T(v) is rendered maximal inextendible at some stage in 
the construction of F(T). Applying the operation F to T(v) introduces a multi-arc into F(T) and thus F(T) is not a 
phylogenetic network, as required. 

Conversely, suppose that T is a binary MUL-tree on X such that there exist no two vertices v, w in T which share a 
parent in T such that T (v) and T (w) are inextendible. Assume for contradiction that FIT ) is not a phylogenetic network. 
Then F(T) must contain multi-arcs a and a!. Put v = t(a) = t(a') and w = h(a) = h{a'). Then v is a tree vertex and w is 
a reticulation vertex of F(T). Let z denote the unique child of w in F(T). Note that since the folding operation implies 
that F(T) cannot contain an arc both of whose end vertices are reticulation vertices, z must in fact be a tree vertex in 
F(T). 

Now, let 7 , / denote two directed paths from the root Pizj , of F(T) to z which contain arcs a and a', respectively, 
and which differ only on those arcs. Then the subMUL-trees T’(y) and T (/) of the MUL-tree U ( F{T )) are isomorphic 
so that, in particular, T ( 7 ) and 'L if) are inextendible. But this is impossible, since there is a directed path 7 " from Pf(t) 
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to v such that y" is the parent of both 7 and 7 ' in U(F(T)) which is isomorphic to T. Thus, F(T) must be a phylogenetic 
network. □ 


As mentioned above, the folding operation F can be considered as the reverse of the operation U. However, there 
exist phylogenetic networks A such that F(U (A)) is not isomorphic to A (see e.g. Fig. 2). Therefore, it is of interest to 
understand those networks A for which F(U (A)) and A are isomorphic. 




Fig. 2 (i) A phylogenetic network N, (ii) U(N), and (iii) the X-network F(U(N)). Clearly, N and F(U(N)) are not isomorphic. 


4 Stable networks 

In this section, we shall give a characterization of phylogenetic networks A for which F(U (A)) is isomorphic to A. We 
call such networks stable. 

We start by recalling the definition of an irreducible network (Huber and Moulton, 2006). Suppose that A is a 
phylogenetic network on X. We call two distinct tree vertices v and w in A identifiable if there exist directed paths y v 
from the root p n of A to v and y w from p n to w such that the subMUL-trees T ( 7 ,,) and T (y v „) of U ( N ) are isomorphic. In 
addition, we say that N is irreducible if it does not contain an identifiable pair of tree vertices. To illustrate, the network 
N depicted in Fig. 2(i) is not irreducible, since the two vertices v and w are identifiable. 

If A is a phylogenetic network then let Ret{N) denote the set of reticulation vertices of A. We call A compressed 
if the child of each vertex in Ret(N) is a tree vertex. Note that in Cardona et al (2009), this property is taken as part 
of the definition of a phylogenetic network, the rationale being that we cannot expect to reconstruct the order in which 
hybridization events occur. 

Theorem 1 Suppose that A is a semi-resolved phylogenetic network. Then the following are equivalent. 

(i) A is stable. 

(ii) A is compressed and irreducible. 

(iii) A is compressed and there does not exist a pair of distinct tree vertices v,w in A such that ch(v) = ch(w). 

Proof (ii) => (iii): Suppose that (ii) holds and assume for contradiction that there exists a pair of distinct tree vertices v, w 
in A such that ch{v) = ch(w). Then ch(v) C Ret(N). Since A is semi-resolved we have |c/t(v)| = 2. Let {a,b} = ch(v). 
Since A is compressed the children a' and b' of a and b, respectively, are tree-vertices of A. Let y v , and y", denote 
the directed paths from the root py of A to a 1 that cross v and w, respectively. Similarly, let yf and yf denote the 
directed paths in A from p n to b' that cross v and w, respectively. Then the subMUL-trees T(yf) and T(yf,) of U (A) are 






Katharina T. Huber et al. 


isomorphic and so are the subMUL-trees T(y' h ,) and T(y^i). Let v denote the subpath obtained from y v , by terminating 
at v ; . Similarly, let fi denote the subpath obtained from y" : , by terminating at w. Then the MUL-tree obtained from T(y' a ,) 
and T(y' b ,) by adding the vertex labelled v and the arcs ( v,y' a ,) and ( v,y' b ,) is T(v). Similarly, the MUL-tree obtained 
from T(jy) and T(y$) by adding the vertex labelled ji and the arcs (n,y%,) and is T(p). Since T(v) and T(fi) 

are clearly isomorphic it follows that v, w is an identifiable pair in A. Hence A is not irreducible which provides the 
required contradiction. 

(iii) => (ii): Suppose that (iii) holds and assume for contradiction that A is not irreducible. Then A contains an 
identifiable pair of vertices v.w. Without loss of generality, we may assume that v and w are such that there are no 
vertices v' and w’ below v and w, respectively, that also form an identifiable pair. 

To obtain the required contradiction, we first claim that ch(v) and ch(w) are contained in Ret{N). Suppose that 
s £ ch(v). For all non-root vertices it of A let y„ denote a directed path from the root py of A to u. If s is a leaf of A then, 
since v and w are an identifiable pair, the MUL-trees T (y v ) and T (y w ) are isomorphic and the underlying bijection is the 
identity on X. Hence, s £ ch(w) holds too and, so, s £ Ret(N) which is impossible as .visa leaf of N. 

If s is a non-leaf tree-vertex of N then, since T (y v ) and T {y w ) are isomorphic and every tree vertex z of A gives rise 
to a subset of vertices in the MUL-tree U (/V), it follows that there exists a non-leaf tree vertex s' below w such that T (y s ) 
and T (y/) are isomorphic. By the choice of v and w, we cannot have that s and s' form an identifiable pair and so s = s' 
must hold. Hence, s £ Ret(N ), which is impossible as s is assumed to be a tree vertex of N. Since every non-root vertex 
of N is either a tree-vertex or a reticulation vertex of A, it follows that ch(v) C Ret(N). Similar arguments imply that 
ch(w) C Ret(N) also holds which completes the proof of the claim. 

To complete the proof, assume for contradiction that there exists some s £ ch(y) — ch(w). Then, s £ Ret(N), by the 
previous claim. Since N is compressed, the child s 1 of s must be a tree-vertex of N. Since T (y v ) and T (y w ) are isomorphic 
it follows that there exists a tree vertex r in N below w such that T(y s i) and T(y r ) are isomorphic. Note that s' ^ r as 
otherwise s' must be a reticulation vertex of N which is impossible. Hence, s' and r form an identifiable pair in N with 
s' below v and r below w which is impossible in view of the choice of v and w. Thus, ch(v ) C ch(w). Similar arguments 
imply that ch(w) C ch(v) and so ch(w ) = ch(v) must hold, as required. But this is impossible in view of (iii). 

(i) => (ii): This follows by Huber and Moulton (2006, Theorem 3). 

(ii) => (i): Suppose that N is compressed and irreducible. Let N b and F(JJ{N)) b denote some binary resolution of N 

and F(U(N)), respectively. Since N is irreducible so is N 1 ', and since N exhibits U(N) so does N b . Hence, by applying 
Huber and Moulton (2006, Corollary 2) to N b and F(U[N)) b and using the assumption that N is compressed, it follows 
that N is stable. □ 

As an immediate corollary (Corollary 1) of this last theorem, we see that the collection of binary, stable phylogenetic 
networks contains a well-known class of phylogenetic networks. More specifically, suppose that A is a phylogenetic 
network. A vertex w of N distinct from some vertex v of A is a sibling of v if v and w share the same parent, and a sibling 
that is a tree vertex is called a tree-sibling vertex. In addition, A is called a tree-child network if every non-leaf vertex of 
A has a child that is a tree vertex of A (Cardona et al, 2008), and A is called a tree-sibling network if every reticulation 
vertex of A has a tree-sibling (Cardona et al, 2009). Note that a tree-child network is a tree-sibling network. 


Corollary 1 Suppose A is a binary compressed tree-sibling network. Then A is stable. 
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Note that there exist semi-resolved, compressed tree-sibling networks that are not stable (Fig. 3(i)), binary, sta¬ 
ble phylogenetic networks that are not tree-sibling (Fig. 3(ii)), and non-binary, tree-child networks that are not stable 
(Fig. 3(iii)). 






Fig. 3 (i) The network onX = {a.b. r.d) is semi-resolved, compressed tree-sibling but not stable, (ii) The network on X = { a , h. c} is binary, stable 
but not tree-sibling, (iii) The network on X = {n, b, c, d} is non-binary, tree-child but not stable. 


5 Folding maps 

In this section, we explore a relationship between the folding/unfolding operations and graph fibrations. For simplicity, 
we shall follow the presentation of the latter topic in Boldi and Vigna (2002). Results from this section will be used to 
establish a main result in Section 7. 

Recall that the head of an arc a in an A"-network N is denoted by hy(a') and its tail by f ; y(a). Now, suppose that 
(T.x) is a pseudo MUL-tree on A and that /V is a phylogenetic network on X. An X-morphism f : T —>■ N is a pair of 
functions fy : V(T) —> V(N), f a : A(T) — > A(N) such that (i) for all a £ A(T), we have /jAt(/i(a)) = fv{hr{a)) and 
0 v(/a(p)) = and (ii) if v G L(T) with v £ x{ x )> x £ X, then fy(v) = x. An A-morphism / is called a rooted 

X-morphism if fviP'r) = Pv also holds. In case the context is clear, we denote both fy and f,\ by /. We call an X- 
morphism / :T —>■ N a folding map 1 if both maps /v,/a are surjective, and for each arc a £ A (N) and v £ V(T) such 
that f(v) = t(a) there is a unique arc a v £ A(T) (the lifting of the arc a at v) such that f(a v ) = a and t(a v ) = v. Note 
that a folding map is necessarily a rooted A-morphism. We call the inverse image / '(v), v £ V(N), the fibre over v. 
Informally, the fibre over v is the subset of V(T) that is mapped to v under /. For example, for the tree T ] and the 
phylogenetic network N depicted in Fig. 1 (ii) and (iii), respectively, the fibers of the vertices u, v and w in N are given 
by the vertices of T " labelled with the same letters. 

We begin by stating a result which illustrates how folding maps naturally arise from the unfolding U ( N) of a network 
N. This result is an analogue of Boldi and Vigna (2002, Theorem 15); the proof is quite similar and straight-forward and 
so we omit it. 

Theorem 2 Let N be a phylogenetic network on X. Then the map f* : U* ( N ) —> N that takes each vertex tz in U* ( N ) to 
its last vertex, and each arc ofU*{N) to the corresponding arc in N is a folding map. 

1 A folding map is analogous to an "opfibration” for digraphs (cf. Boldi and Vigna, 2002, Definition 4). 
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As we shall now show, the folding F(T) of a MUL-tree T can also give rise to a folding map. In analogy with Boldi 
and Vigna (2002, p. 25), we say that an equivalence relation ~ on the vertex set V(T) of a pseudo MUL-tree T satisfies 
the local out-isomorphism property (LOIP) if the following holds for all v, w £ V(T). 

- LOIP: If v ~ w then there is a bijection q from the set of arcs in T with tail v to the set of arcs in T with tail w such 

that h(a) ~ h(£, (a)), for all arcs a in T with tail v. 

We now use the LOIP-property to characterize when equivalence relations on MUL-trees give rise to folding 
maps (cf. Boldi and Vigna, 2002, Theorem 2). To aid clarity of presentation, we denote the parent of a non-root vertex 
v in a rooted directed tree by pa{v). 

Theorem 3 Suppose that (I'-X) is a pseudo MUL-tree on X, and ~ is an equivalence relation on V(T). Then the 
equivalence classes [,]^ are the fibres of a folding map f : T —► A (for some phylogenetic network N on X) if and only 
if ~ satisfies the following five properties: 

(i) LOIP, 

(ii) for all v £ V(T) with in-degree and out-degree 1, |[v]^| > 2, and pa(y) f pa(v") for some v" £ [v]~, 

(iii) for all v £V{T) with in-degree 1 and out-degree not equal to 1, pa(v) ~ pa(v’) for all v’ £ [v]~, 

(iv) for all x £X and v £ % ( x ), [t']~ = %(x), and 

(v) for all v £ V(T) — {pr}, pa(v) /= pa(v') for all v' £ [v]^ distinct from v». 

Proof Assume first that A is a phylogenetic network on X and / : T —)• N is a folding map such that the equivalence 
classes [,]^ of ~ are the fibres of /. For each v, w £ V(T) with v ~ w, define a map % from the set of arcs a in T with tail 
v to the set of arcs in T with tail w by putting |(a) equal to f{a) w . Then f(h(^(a))) = f(h(f(a) w )) = h(f (f (a) w )) = 
h(f(a)) = f(h(a)). Hence h(fq (a)) ~ h(a), and so ~ satisfies (i). Moreover, as A is a phylogenetic network, it is straight¬ 
forward to check that (ii) must hold as no vertex in A can have in-degree and out-degree 1, (iii) must hold as every vertex 
of A that is not the root of A is either a reticulation vertex or a tree vertex (but not both), and that (iv) must hold as all 
elements in x(x) must be mapped by / to a vertex labeled by x which has in-degree 1. Finally, (v) follows from the fact 
that A does not contain multi-arcs. 

Conversely, assume that ~ is an equivalence relations on V (T) that satisfies properties (i)-(v). To simplify notation, 
put [it] = [(/]-.. for all vertices u in V{T). Let T/~ be the network obtained by taking the quotient of T by ~ (as described 
in Section 3). In particular, T /~ is a rooted DAG with vertex set V(7’)/~, and ([it], [v]) an arc in T /~ for m,v £ V(T) 
if and only if (u',v') £ A(T) for some u' £ [ u\ and v' £ [v] (note that this definition is independent of the choice of 
u' and v'). In addition, we identify each leaf [u] in T/ ~ with the necessarily unique element x in X with [u] = x( x ) 
whose existence follows from property (iv). It is straight-forward to check that properties (i)-(v) ensure that T/ ~ is a 
phylogenetic network on X. 

Now, define f: T —> T/ ~ to be the A-morphism that maps each vertex u in V(T) to its equivalence class [it], and 
each arc (u,v) in A(T) to the arc ([it], [v]). It is straight-forward to check that / is a folding map as properties (i) and (iv) 
imply that / yields a well-defined surjective A-morphism from T to 7’/ ~ that satisfies the aforementioned arc lifting 
property. □ 

Given a MUL-tree T, consider the equivalence relation ~y+ on the vertex set V (7’ : ) of the pseudo MUL-tree 7 r 
defined in Section 3. Since ~ r t satisfies properties (i) - (v) of the last theorem it follows that, in case F(T) is a 
phylogenetic network, we obtain a folding map T —► F(T) = T " / ~ r t whose fibres are the equivalence classes of ~ r t. 
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We now state a result that provides additional insight into unfoldings of networks, and that will also be useful in the 
last section. It can be regarded as a phylogenetic analogue of path lifting in topology (cf. also Boldi and Vigna, 2002, 
Theorem 13 and Corollary 14). 

Theorem 4 Suppose that T and T' are pseudo MUL-trees on X, that N is a phylogenetic network on X and that g :T' 

N is an X-morphism. If f : T N is a folding map, then there exists an X-morphism g : T' —► T such that fog = g. 
Moreover, if g is a rooted X-morphism, then so is g, and g is necessarily unique. 

Proof Using a top-down approach, we define g recursively as follows. Since / is a folding map, there exists a vertex 
u in f^ 1 {g(pT'))- We set g(pT') = w. Now, if the map g has been defined on the parent v' of some v £ V(T' ) as well 
as the arcs and vertices on the directed path from p T i to v', and a = (V,v) £ A(T'), then we define g(a) = g(a)^ v '\ 
and g(v) to be the head of this arc in T. It is straight-forward to check that the mapping g that we obtain in this way 
yields an X-morphism with the desired property. Moreover, if g is a rooted X-morphism, then py = g(pr') and hence 
/ 1 (siPr 1 )) = {Pt}- This implies that g is a rooted X-morphism, and that g is the only such map. □ 

As a corollary of this result, we now see that the pseudo MUL-tree U* ( N) can be regarded as a phylogenetic analogue 
of the universal total graph of N (at py), a graph theoretical variant of the universal cover of a topological space (cf. 
Boldi and Vigna, 2002, Section 3.1). 

Corollary 2 Suppose that T' is a pseudo MUL-tree and N is a phylogenetic network, both on X, and that g : T' N is 
a folding map. Then T' is isomorphic to U*(N). 

Proof Applying Theorem 4 with T = U*{N) and f = f* : U*(N) —>• N, it follows that there exists a unique rooted X- 
morphism g :T' —» U* ( N ) with fog = g. Since g is a folding map, it follows that g is also a folding map, and hence an 
isomorphism, as required. □ 

Using again the notation for a guidetree for the operation F, we now use this last result to provide an alternative 
characterisation for stable networks. 

Corollary 3 Suppose that N is a phylogenetic network. Then N is stable if and only ifU*(N) is isomorphic to [t/(V)]^. 

Proof Suppose N is stable, that is, N is isomorphic to F(U(N)). By the comment following Theorem 3, there exists a 
folding map from the pseudo MUL-tree [U (V)]^ to F(U ( N )). As N is isomorphic to F(U ( N )), there also exists a folding 
map from U*(N) to N. By Corollary 2, it follows that U*(N) is isomorphic to [t/ (2V)] . 

Conversely, suppose U*(N) is isomorphic to [I/(V)]^ and write rather than ~[j/pv)]t- By Theorem 2 we have a 
folding map f* : U* (N ) —>• N. Hence, by Theorem 3, there exists an equivalence relation on V (U* (N )) such that N is 
isomorphic to t/*(V)/~*. Moreover, u v in V(U*(N)) if and only if the pseudo MUL-trees U*(N)(u) and U*(N)(v) 
are isomorphic. 

Now, F(U(N)) is isomorphic to [t/(V)] 1 '/where u' v' in V([I/(V)] 1 ' if and only if [I/(V)] 1 '(m') is isomorphic 
to [[/(AOJ^v'). Therefore, the two equivalence relations and are equal (up to the isomorphism between U*(N) 
and [U (V)] 1 ), and hence N is isomorphic to F(U (V)), as required. □ 

Note that our definition for folding maps can be extended to obtain folding maps between X-networks in general. 
We will not pursue this possibility further here, but it could be of interest to understand categorical properties of such 
maps (cf. Boldi and Vigna, 2002, Section 6). 
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6 Displaying trees in stable networks 

Following van Iersel et al (2010), we say that a phylogenetic network A on A displays a phylogenetic tree T on X if there 
is a subgraph A' of A that is a subdivision of T (i. e. N' can be obtained from T by replacing arcs ( u , v), w, v £ V(T r ) by 
directed paths from u to v). We illustrate this concept in Fig. 4. 





Fig. 4 The phylogenetic tree in (i) is displayed by the network in (iii), but the tree in (ii) is not. 


In Kanj et al (2008) it is shown that it is NP-complete to decide whether or not a given phylogenetic tree is displayed 
by a given phylogenetic network. On the other hand, in van Iersel et al (2010) it is shown that there are polynomial 
algorithms for this problem for certain classes of networks e. g. binary tree-child networks. Thus it is of interest to 
know the complexity of this question for stable phylogenetic networks. We show that the following decision problem is 
NP-complete. 

TreeDisplaying 

Instance: A binary stable phylogenetic network on X and a binary phylogenetic tree on X. 

Question: Is T displayed by A? 

To establish this fact, we show that this problem is NP-complete when restricted to compressed, binary tree-sibling 
networks and apply Corollary 1. In the proof, we shall use the following operation, which is a modification of an 
operation with the same name defined in van Iersel et al (2010). Suppose that A is a binary phylogenetic network on 
X and that R is a binary phylogenetic tree on X. Let p,y denote the root of A, let pi< denote the root of R, and let 
v € V (A). Assume that x v ,x[,. p v ,q v ,p v are pairwise distinct vertices not already contained in A and that x v , x[„ p v and p v 
are also not contained in R. Then the operation HangLeaves(v) adds the vertices x v .x[,.p v .q v .p v to A as well as the 
arcs (Pv,PaO, (Pv,Pv), ( p v ,q v ), (PvAv) and ( q v ,x v ). In addition, it adds the vertices x v ,x[,,p v ,p v to R as well as 

the arcs (p v ,Pr), (p v ,Pv), (Pv,x v ), and (p v ,x[,). 

Theorem 5 TreeDisplaying is NP-complete, even when restricted to the class of binary, compressed tree-sibling 
networks. 

Proof By Corollary 1, it suffices to restrict attention to the class of binary, compressed tree-sibling networks. Let T be a 
binary phylogenetic tree on X and let A be a binary phylogenetic network on A. We will (in polynomial time) modify A 
to a binary, compressed tree-sibling network A* on some leaf set A* that contains A and, simultaneously, modify T to a 
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binary phylogenetic tree T* on X*. For T* and N* we then show that T is displayed by N if and only if T* is displayed 
by N*. The result then follows as it has been shown in Kanj et al (2008) that it is NP-complete to decide whether or not 
a binary phylogenetic tree is displayed by a binary phylogenetic network. 

The construction of N* is in two steps. In the first step, we repeatedly apply the operation HangLeaves to transform 
N into a compressed tree-sibling network on some yet to be specified leaf set X' and T to a binary phylogenetic tree on 
X'. To do this we associate to N a phylogenetic network N\ in which every reticulation vertex has a unique child and 
that child is a tree-vertex. This is achieved by carrying out the following operation. For each arc e in N whose head is a 
reticulation vertex of N we subdivide e by a new vertex v e and then apply HangLeaves to v e . We denote the resulting 
rooted DAG by N\. Note that N\ is clearly a binary phylogenetic network on X, every reticulation vertex of N\ has a 
unique child, and that child is a tree-vertex. Furthermore, every reticulation vertex of N i that is also a reticulation vertex 
of N has two siblings in N\ both of which are reticulation vertices. 

Next, we follow the proof of van Iersel et al (2010, Theorem 3) and choose for every reticulation vertex v of N i that 
is also a reticulation vertex in N one of its two siblings. Let s denote that sibling. Let p s denote the joint parent of 5 and v 
in /V]. Then we subdivide the arc (p x .s) of N\ by a new vertex v s and apply HangLeaves to v s . We denote the resulting 
DAG by N 2 . Note that v s is a tree-sibling of v in AA, and that x! v is a tree-sibling of q v , in AA. Let X* denote the union of 
X and all of the leaves added to N this way. Then it is easy to check that the resulting DAG N* is a binary, compressed 
tree-sibling network on X*. Moreover, the phylogenetic tree T* constructed in concert with N* is clearly binary and has 
leaf setX*. 

We now establish our claim that T is displayed by N if and only if T* is displayed by N*. To do so, we first show that 
T is displayed by N if and only if T' is displayed by N' where N' and T' are a phylogenetic network and a phylogenetic 
tree on X' , respectively, that are the result of a single application of operation HangLeaves, to a vertex v of N. 

Assume first that T is displayed by N. To see that T' is displayed by N' note first that there exists a subgraph N" 
of N that is a subdivision of T. Combined with the fact that the subgraph of N' with vertex set x v ,x[,,p Vl q v ,p v ,pN and 
arc set (p v ,pAr), ( p v ,p v ), ( p v ,q v ), (p v ,x' v ) and ( q Vl x v ) is a subdivision of the subtree of T" of T' whose vertex set is 
x v ,xf v ,p v ,p v ,pT and whose arc set is (p,.,pr), (p,.,Pv). (p v ,x v ), and (p v .x[ t ), it is easy to see that N" gives rise to a 
subgraph of N' that is a subdivision of T'. Thus, T' is displayed by N'. 

Conversely, assume that T' is displayed by N', that is, there exists a subgraph N" of N' that is a subdivision of 
T'. Clearly, the restriction of N" to V(N") — {x v .x' v .p v .q v ,p v } is a subgraph of N that is a subdivision of T' restricted 
to V{T') — {x v ,xf v ,p v ,p v }, that is T. Thus, T is displayed by N which completes the proof of the claim. A repeated 
application of the last claim implies that T is displayed by N if and only if T* is displayed by A*. □ 


7 Weakly displaying trees 

Given a phylogenetic tree T and a network N on X, we say that T is weakly displayed by N if it is displayed by U ( N ) 
(that is, there exists a subgraph of U ( N ) that is a subdivision of T). For example, both of the trees in Fig. 4 are weakly 
displayed by the phylogenetic network N, but the tree in (ii) is not displayed by N. As we shall see, this concept is 
closely related to the problem of reconciling gene trees with species networks. In Section 6, we studied the problem of 
displaying trees in networks, in particular showing that it is NP-complete to decide whether or not a binary phylogenetic 
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tree T is displayed by a phylogenetic network N even if it is stable. In this section, we show that, in contrast, one can 
decide in polynomial time whether or not a tree is weakly displayed by any given phylogenetic network. 

Before presenting our algorithm, we first derive a characterization for when a tree is weakly displayed by a phylo¬ 
genetic network in terms of so-called tree reconciliations. Given a phylogenetic network N on X , let V tr (N) be the set 
consisting of the tree vertices in V(N) together with the root on N. Following Zhang et al (2011), a reconciliation map 
between a phylogenetic tree T on X and a phylogenetic network N is a map r : V{T) — > V(N) such that r(v) £ V tr (N) 
for all v £ V(T), r(x) =x holds for all x £ X, and every arc (u. v) in A (T) is associated with a directed path P r (u,v) in A' 
with initial vertex r{u) and terminal vertex r{v). 

We now give the aforementioned characterization for when a tree is weakly displayed by a phylogenetic network. 
We call a reconciliation r between T and N locally separated if for each pair of vertices Vi and V 2 in T that have the 
same parent v, both P,-(v, vi) and P r (v, v’ 2 ) contain at least one arc, and the initial arc in P r (v, vi) is distinct from the initial 
arc in P r (v, V2). 

Theorem 6 Suppose that N is a phylogenetic network on X. Then a phylogenetic tree T on X is weakly displayed by N 
if and only if there exists a locally separated reconciliation between T and N. 

Proof We first prove that if there is a locally separated reconciliation r between T and N, then T is weakly displayed 
by N. We illustrate the main idea of the proof in Fig. 5 - essentially, the map r induces an A'-morphism r* from a 
subdivision T* of T into N, and so, using Theorem 4, we obtain an A'-morphism r from T* to U*(N), from which we 
can then deduce that T is displayed by U(N). 

More specifically, suppose that r is a locally separated reconciliation between T and N. Since each arc in T is 
associated with a directed path in N which contains at least one arc, it follows that r induces an A'-morphism r* from 
a subdivision T* of T to N. By Theorem 4, let r be an A'-morphism from T* to U*(N) such that f* or = r*. Since r is 
locally separated, it follows that the map f is injective, and hence T* is isomorphic to a subgraph of U*(N). 


U(N ) 



Fig. 5 The relationship between the maps described in the proof of Theorem 6. 


Now, consider the set Vo C V(T*) that is the pre-image of the in-degree one and out-degree one vertices in U*(N ) 
under r. Then, since f is an A'-morphism, each vertex in Vo has in-degree one and out-degree one. Let T' be the tree 
obtained from T* by suppressing all vertices in Vo. Then T' is a subdivision of T. Since U(N) does not contain any 
in-degree one and out-degree one vertices and U* (N) is a subdivision of U ( N ) it follows that T' is isomorphic to a 
subdivision of U*(N). Thus, T is displayed by U(N), and so T is weakly displayed by N, as required. 
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Conversely, suppose that T is weakly displayed by N. Then there exists a subdivision T' of T such that T' is 
isomorphic to a subgraph of U(N). Since U*(N) is a subdivision of U(N), there exists a subdivision T* of T' (and 
hence also a subdivision of T) such that T* is isomorphic to a subgraph of U*(N). Denote the A'-morphism from T* to 
U*(N) induced by this isomorphism by r* and let f* be the folding map from U*(N) to N given by Theorem 2. Then 
the A'-morphism f* o r* from T* to N induces a map r from V(T) to V (N) defined by putting r(v) = f* or* (v), for all 
v £ V(T). Clearly, r(v) exists because r*(v) is contained in V(U*(N)) and thus in V(U(N)) as r* is an A'-morphism. 
Moreover, since /* and r* are A'-morphisms it follows that r(v) is a tree vertex of N. 

Now, for every arc (u,v) in T, denote the subdivision of (m,v) in T* by P„ A (that is, P uv is the necessarily unique 
path from u to v in T*) and let P ' r (u, v) be the image of P uv under f* or*, a directed path from r(u) to r(v) in N which 
contains at least one arc. Then it follows that r is a reconciliation between T and N. Moreover, to see that r is locally 
separated, consider an arbitrary pair of distinct vertices v\ and V 2 in V(T) that have the same parent v. Denote the initial 
arcs of the two (necessarily distinct) directed paths P V _ V) and P V V2 in T* by a\ and 0 . 2 , respectively. Furthermore, for any 
arc aofU ( N ) put t(a ) = /[/(at) (a) and h(a ) = hu(N){a). Since r* is induced by an isomorphism between T* and a certain 
subgraph of U*(N ), we obtain r*(a\) f r*(a 2 ). Combined with t(r*(a\)) = t(r*(a 2 )) = v and Property (v) in Theorem 3 
it follows that f*(h(r*(a 1 ))) 7 ^ f*(h(r*(a. 2 )))- Therefore P, (v, vi) and P r (v,V 2 ) contain distinct initial arcs, from which 
it follows that r is locally separated, as required. □ 

In light of the last result, deciding whether or not a phylogenetic tree is weakly displayed by a phylogenetic network is 
equivalent to the following decision problem: 

Locally separated reconciliation 

Instance: A phylogenetic network N on X and a binary phylogenetic tree T on X. 

Question: Does there exist a locally separated reconciliation between T and N? 

We now present a dynamic programming algorithm to solve this problem. Let A be a phylogenetic network on X and 
let T be a binary phylogenetic tree on X. Then for every tree vertex v in N we denote by N(v) the phylogenetic network 
obtained from by N by first restricting N to v and all the vertices of N below v and then suppressing any resulting vertices 
with in-degree one and out-degree one. In addition, we define a function T : V(T) x V(N) —> {0,1} as follows. If v is 
not a leaf in V(T), then we set x (v,n) = 1 if and only if there exists some u! £ V tr (N) such that (i) u' = n or u! is below 
u in N, and (ii) there exists a locally separated reconciliation between T (v) and N(u'). If v is a leaf with label x, then we 
set t(v, u) = 1 if and only if u is a leaf in N labeled with x or x is a leaf in N below u. We remark that t(v, 11 ) = 1 implies 
that t(v, u* ) = 1 holds for all u* such that u is below u* in N. 

By definition, there exists a locally separated reconciliation between T and N if and only if x(pt,Pn) = 1. In order 
to compute the value of t(p t ,Pn), we will use the following result concerning the function x. 

Proposition 2 Let T be a binary phylogenetic tree on X, and N a phylogenetic network on X. Suppose that v is an 
interior vertex in T with two children Vi and V2 and u £ V(N). Then X (v,n) = 1 if and only if u is an interior vertex in 
N with x(v,u') = 1 for a child u' of u, or there exist two distinct children n\,ii 2 of u in N such that t(vi,ki) = 1 and 
X(V2,U2) = 1. 

Proof We begin by establishing the ‘if’ direction. Note that if x(v,u') = 1 holds for a child u' of u then, by the previous 
remark, x(v,u) = 1 follows. Therefore we may assume that u is an interior vertex in N with two children u\ f U 2 in N 
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such that t(vi,«i) = 1 and z(y 2 ,U 2 ) = 1. This implies that there exist two (not necessarily distinct) vertices u\ and u' 2 
in N such that for i = 1,2, there exists a locally separated reconciliation /, between T (v,) and /V(m') . Fix a directed path 
Pi in N obtained by combining the arc (m, m,) and an arbitrary path from m,- to u\. Since u\ 7 ^ mi the paths Pi and Po both 
contain at least one arc and their respective first arcs are distinct. 

Now consider the map f :V(T (v)) —>■ V ( N(u )) defined, for all v' GV(T (v)), by f(v') = 11 if v' = v, f(v') = f\ (V) if 
v' is contained in P(vi), and f(v') = faiy 1 ) otherwise. Since vi and V 2 are the two children of v and Py(v,v,-) = P, holds 
for i = l,2 and Yf{v' ,v") = Pj^v' ,v") holds for each arc (v',v") in T(v ,-) it follows that / is a reconciliation between 
T(v) and N(u). Combined with the fact that f\ and fa are locally separated, it follows that / is also locally separated. 
Hence, z (v,u) = 1, as required. 

Conversely, suppose that t(v, u) = 1 for v an interior vertex in T and 11 € V(T ). We may further assume that t(v, u') = 
0 for each child u! of u as otherwise the proposition clearly follows. Under this assumption, it follows that there exists a 
locally separated reconciliation / between T(v) and N(u) with /(v) = u. 

Now, let u'i = f(yi) for i = 1,2 (where u\ is not necessarily distinct from u' 2 ). Since iy is a child of v and / is a locally 
separated reconciliation, it follows that «' is below u and that = 1. Considering the two directed paths P/ (v, i',) 

which have the same starting vertex v but distinct initial arcs, it follows that there exist two distinct children u\ and 112 
of u such that u\ is contained in N(uj) for i = 1,2. Together with t(v;, m)) = 1, this implies = 1, as required. □ 

The above proposition forms the basis of a dynamic programming algorithm for computing z( pt,Pn) in polynomial 
time, which we now briefly describe. 

Let m = V(P)|, n = |V(N)|, and let k be the maximum number of children that any vertex in N may have. Note 
first that a topological ordering {vi,..., v,,,} of V(T) (that is, a linear ordering of V(T) such that v; is below vy in T 
implies j > i), can be computed in O(m) time. Similarly, we can compute a topological ordering {u\ ,..., u„} of V(N) in 
O(kn) time. Now, noting that v m = pr and u n = p^, consider the mxn matrix whose (i,y)th entry is T(v,-,My). Then by 
Proposition 2, it takes 0(mnk) time to fill this matrix and, therefore, to compute T(py,p,v). Since a binary phylogenetic 
tree T on X has 2\X\ — 1 vertices (Semple and Steel, 2003, Proposition 1.2.3), we summarize this last discussion in the 
following corollary. 

Corollary 4 Suppose that T is a binary phylogenetic tree on X, and that N is a phylogenetic network on X. Then, using 
a dynamic programming algorithm, it can be decided in 

0(|X|-|V(AO|- max \ch(v)\) 
vev{N) 

time whether or not T is weakly displayed by N. 

Acknowledgements VM and MS thank O. Gascuel for organizing the “Mathematical and Computational Evolutionary Biology” meeting, June 2012, 
Hameau de l’Etoile, France, where first ideas for the paper were conceived. 


References 


Boldi P, Vigna S (2002) Fibrations of graphs. Discrete Mathematics 243(l):21-66 

Cardona G, Llabres M, Rossello F, Valiente G (2008) A distance metric for a class of tree-sibling phylogenetic networks. Bioinformatics 24(13): 1481— 


1488 



Folding and unfolding phylogenetic networks 


17 


Cardona G, Rossello F, Valiente G (2009) Comparison of tree-child phylogenetic networks. IEEE/ACM Transactions on Computational Biology and 
Bioinformatics (TCBB) 6(4):552-569 

Gusfield D (2014) ReCombinatorics: The Algorithmics of Ancestral Recombination Graphs and Explicit Phylogenetic Networks. MIT Press 
Huber K, Moulton V (2006) Phylogenetic networks from multi-labelled trees. Journal of mathematical biology 52(5):613-632 
Huber KT, Moulton V, Spillner A, Storandt S, Suchecki R (2012) Computing a consensus of multilabeled trees. In: Proceedings of the Meeting on 
Algorithm Engineering & Experiments (ALENEX 12), SIAM, pp 84-92 
Huson DH, Rupp R, Scornavacca C (2010) Phylogenetic Networks: Concepts, Algorithms and Applications. Cambridge University Press 
van Iersel L, Semple C, Steel M (2010) Locating a tree in a phylogenetic network. Information Processing Letters 110:1037-1043 
Kanj I, Nakhleh L, Than C, Xia G (2008) Seeing the trees and their branches in the network is hard. Theoretical Computer Science 401(1): 153-164 
Lott M, Spillner A, Huber KT, Petri A, Oxelman B, Moulton V (2009) Inferring polyploid phylogenies from multiply-labeled gene trees. BMC 
evolutionary biology 9(1):216 

Marcussen T, Heier L, Brysting AK, Oxelman B, Jakobsen KS (2015) From gene trees to a dated allopolyploid network: Insights from the angiosperm 
genus viola (violaceae). Systematic biology 64( 1):84—101 

Pardi F, Scornavacca C (2015) Reconstructible phylogenetic networks: Do not distinguish the indistinguishable. PLoS Computational Biology 
ll(4):e 1004,135 

Semple C, Steel M (2003) Phylogenetics. Oxford University Press 

Willson S (2012) CSD homomorphisms between phylogenetic networks. IEEE/ACM Transactions on Computational Biology and Bioinformatics 
(TCBB) 9(4): 1128-1138 

Wu T, Zhang L (2011) Structural properties of the reconciliation space and their applications in enumerating nearly-optimal reconciliations between 
a gene tree and a species tree. BMC bioinformatics 12(Suppl 9):S7 

Zhang L, Ng YK, Wu T, Zheng Y (2011) Network model and efficient method for detecting relative duplications or horizontal gene transfers. In: 
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on, IEEE, pp 214—219 



